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Description 

The present invention relates to a method for efficiently producing a desired protein. More specifically the present 
invention relates to a method for producing a protein in a bacterial cell as a soluble protein More precisely, the present 

5 invention is related to manufacture of a protein expressed in a bacterial cell as a soluble, active protein that is normally 
expressed in a bacteria! cell as an insoluble, inactive protein; an expression cassette or an expression vector; and a 
transformant used for the method. 

A cell IS not always found under ideal conditions. The cell is exposed to various stresses such as changes in 
temperature, pH, etc. It is known that when a cell is exposed to a high temperature, the cell produces a group of specific 

10 proteins known as "heat shock proteins" (HSP) (Ellis, R.J. et., al (1990) Molecular Chaperons; The plant connection. 
Science 250: 954-959). The HSP described in this publication is known as a molecular chaperon and is constitutively 
expressed. The role of the molecular chaperon has been carefully studied and it has been found to be involved in 
biological functions common among different species, such as formation and maintenance of the higher structure of 
the protein, membrane permeation of a protein, regulation of cell cycle, origin and differentiation of a cell, and functions 

^5 of the immunological system (Zeilstra-Ryalls, J.. O. Fayet and C. George, (1991) The universally conserved GroE 
(Hsp50) chaperonins. Annu. Rev Microbiol. 45; 301-325, Ellis, R.J. et,, al. (1991) Molecular Chaperons, Annu, Rev, 
Biochem, 60; pp. 321 -347), HSPs are classified into the following 5 families by their molecular weights 
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1, HSP60 (chaperonin) family 

2, Hsp70 family 

3, Hsp90 family 

4, TRIG family 

5, other family 



GroEL, HspSO. CpnSO 
DnaK, Hsp70, Bip 
HtpG, Hsp90, Grop94 
TF55, TRiC (TCP1) 
GroES, Hsp28, Hsp45 



Recently, a HSP was found to aid in formation of conformation and higher structure of a protein even in vitro. 
Therefore, the elucidation of the structure and function of such a HSP becomes important. A HSP which is involved in 
the conformation and the conformational change of a protein is GroEL. GroEL, when combined with GroES having a 
molecular weight of a subunit 10KDa, has been shown to aid higher structure formation of various proteins in vivo or 
in vitro, GroEL has ATPase activity and has a characteristic 14 mer quaternary structure composed of two 7 mer 
doughnut shaped subunits, GroES, like the subunits of GroEL, is considered to be a 7 mer and has a doughnut-like 
structure The 14 mer of GroEL and the 7 mer of GroES form a complex at a 1 ;1 ratio in vivo and acts as a GroE protein 
(figure 19; Yasushi Kamata BIO VIEW (1 993)), Among chaperonin proteins, GroE has been well studied with respect 
to their involvement in the formation of protein structure. In the eukaryotic cell, a protein called "t-complex polypeptide- 
1 " (TOPI ) has been found to activate ATP dependent actin formation or tubulin formation in vitro (Gao et., al. (1 992) 
A cytoplasmic chaperonin that catalyzes |5-actin folding. Cell 69; pp 1043-1050; and Yatfe et,, al. (1992) TCP1 complex 
IS a molecular chaperon tubulin biogenesis. Nature 358; pp. 245-248). Recently, it has been reported that hyperthemo- 
philic archaeon has a TCP-1 -like molecular chaperon (TF55) (Jonathem D, et.. al, (1991 ) A molecular chaperon from 
hyperthermophilic archaebacterium is related to the eukaryotic protein l-complex polypeptide-1 . Nature 354; pp. 
490-493), 

In thermophilic bacteria, all the biopolymers are stabilized in order to tolerate the high temperature. Therefore, 
proteins derived from thermophilic bacteria are applied to various fields such as polymerase chain reaction and bio- 
sensor (Satki et., al, (1986) Primer-directed enzymatic amplification of DNA with a thermostable DNA polymerase. 
Science 239; 487-491; and Kagawa et,, al, (1989) Biotechnologtcat applications of thermophilic ATP synthetase. Mem- 
brane electronics and gcnotics. J Membrane Sci . 41 pp. 237-247). Since molecular chaperon from thermophilic and 
hyperthermophilic archaebacterium are considered to have high stability they are extremely useful for studying mech- 
anisms of higher conformational structure formation, artificially induced formation of a higher structure of a desired 
protein, or renaturation. 

In the field of genetic engineering, in order to produce a desired protein in a large amount and lor efficient recovery, 
a bacterial cell is generally used as a host since the bacterial cell is easy to grow and to manipulate However, in a 
bacterial cell, a foreign protein is mostly expressed in an insoluble and inactive form such as an inclusion body Also, 
in the case where a foreign promoter which can function in a bacterial cell is used, the protein expressed is an insoluble, 
inactive protein. 

The recovered insoluble, inactive protein can then be treated to solubilize and reactivate it In the case where the 
insoluble protein is a heat stable enzyme, a heat treatment is conducted to solubilize the insoluble protein However, 
since recovery is low a method lor expressing a protein in soluble form is required. 

In one aspect of the present invention there is provided an expression cassette which can express a desired protein 
in a host cell, wherein the cassette comprises a sequence in which a gene encoding a molecular chaperon is operably 
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linked to a first promoter and a site to which a gene encoding the desired protein can be inserted. 

In one ennbodinnent of the present invention the expression cf^ssetle is functional in a bacterir^l cell 

In one embodiment of the present invention, the expression cassette can express a protein in a soluble form, which 

IS expressed as an insoluble form in a bacterial eel! in the absence of the molecular chaperon. 
5 In one embodiment of the present invention, the expression cassette has a second promoter, and the second 

promoter is present upstream of the insertion site and is located so as to promote expression of the inserted qene. 
In another embodiment of the present invention, the expression cassette has a terminator sequence downstream 

of the gene encoding the molecular chaperon and downstream of the site to which the gene encoding the desired 

protein is inserted. 

^0 in still another embodiment, the gene encoding the desired protein is inserted as an expressible form. 

In still another embodiment, the gene encoding the molecular chaperon is a heat shock protein (HSP) gene of a 
hyperthermophilic archaeon KOD-1. 

In still another embodiment, the gene encoding the molecular chaperon is a GroESL gene of Bacillus stearother- 
mophilus SICI. 

^5 In still another embodiment, both the first and the second promoter are T7 promoters. 

In another aspect of the present invention there is provided an expression vector comprising the above expression 
cassette, the desired gene being operably incorporated into the cloning site. 

The present invention further relates to a cell which can express a desired protein, wherein the cell is transformed 
wrth an expression cassette or an expression vector containing the expression cassette, and wherein the cassette 
^0 comprises a sequence in which a gene encoding a molecular chaperon is operably linked to a first promoter and a site 
to which a gene encoding the desired protein can be inserted. 

In another aspect of the present invention there is provided a cell which can express a desired protein in a soluble 
form, wherein the bacterial cell is co-transformed with a vector which can express a gene encoding a molecular chap- 
eron and a vector which can express a sequence encoding the desired protein. 

In one embodiment of the present invention, the expression cassette is functional in a bacterial cell. 

In one embodiment of the present invention, the expression cassette can express a protein in a soluble form, which 
is expressed as an insoluble form in a bacterial cell in the absence of the molecular chaperon. 

In one embodiment of the present invention, the gene encoding the molecular chaperon is a heat shock protein 
gene of a hyperthermophilic archaeon KOD-1 
30 In another embodiment of the present invention, the gene encoding the molecular chaperon is a GroESL gene of 

Bacillus stearothermophilus SICI. 

In yet another aspect of the present invention there is provided a method for expressing a desired protein in a 
soluble form, wherein the method comprises a step of culturing a cell which can co-express a gene encoding a molecular 
chaperon and a gene encoding the desired protein. 
35 In one embodiment of the present invention, the cell is transformed with a vector having a gene encoding a mo- 

lecular chaperon being operably linked to a first promoter and having a gene encoding the desired protein operably 
linked to a second promoter. 

In another embodiment of the present invention, the cell is co-transformed with a vector which can express a gene 
encoding a molecular chaperon and a vector which can express a sequence encoding the desired protein. 
-^0 In sill! another embodiment of the present invention, the gene encoding the molecular chaperon is a heat shock 

protein gene of a hyperthermophilic archaeon KOD-1. 

In stili another embodiment of the present invention, the gene encoding the molecular chaperon is a GroESL gene 
of Bacillus stearothermophilus SICI 

In one embodiment of the present invention, the host cell is a bacterial cell. 
•^^ In one embodiment of the present invention, the desired protein is expressed in a soluble form, which is expressed 

as an insoluble form in the absence of the molecular chaperon 

In a further aspect of the present invention there is provided a method for expressing a desired protein in a soluble 
form compfibing 

culturing a cell having an expression cassette or an expression vector containing a gene encoding a molecular 
chaperon and a gene encoding the desired protein and co-expressing the molecular chaperon and the desired 
protein 

heating the cell culture broth or a fraction containing the desired protein; 
separating an insoluble fraction: and 
55 rocovoring the desired protein 

In one embodiment of the present invention, the cell is transformed with a vector having a gene encoding a mo- 
lecular chaperon being operably linked to a first promoter and having a gene encoding the desired protein operably 



3 



BNSDOClD: <EP 0774512A2 I > 



EP 0 774 512 A2 



linked to H second promoter 

In Hnolnnr ombcdimont ot the prosont invention the cell is co-transfornned with r vector which can express a gone 
encoding H molcculrir chaperon and a vector which can express a sequence encoding the desired protein 

In siill another embodiment of the present invention, the gene encoding the molecular chaperon is a heat shock 
s protein gene of a hyporthermophilic archaeon KOD-1. 

in s'lll another embodiment of the present invention, the gene encoding the molecular chaperon is a GroESL gene 
of Bacilljs slcarolhcrmophilus SIC! 

In one cmbodinnent of the present invention, the host cell is a bacterial cell. 

In one embodiment of the present invention, the desired protein is expressed in a soluble form, which is expressed 
TO as an insoluble form in the absence of the molecular chaperon. 

In a further aspect of the present invention there is provided a method for changing a heat liable protein to heat 
stable protein comprising mixing the heat liable protein and a heat stable molecular chaperon 

In a further aspect of the present invention there is provideda method for purifying a heat liable protein comprising: 
mixing the heal liable protein and a heat stable molecular chaperon; and heating the mixture. 
^5 In a lurthcf aspect of the present invention there is provided a heat shock protein ot KOD-1 comprising an amino 

acid sequence ol SbO ID NO 7 

In a lurihcr aspect ol the present invention, there is provided a gene encoding a heat shock protein ot KOD-1 
comprising an amino acid sequence of SEC ID NO: 7 

Thus the invention described heroin makes possible the advantages of providing: 
^0 rt proiein wi'iicfi is expressed in a cell, for example a bacterial cell, as an insoluble inclusion body that can be 

expiobseJ Ml itio bnctenal cell as a soluble protein, by using vector(s) which can expiess the molecular chaperon and 
the dcsKcd protein simuftaneously thereby making it possible to recover the desired protein efficiently. 

These and other advantages of the present invention will become apparent to those skilled in the art upon reading 
ijnd understanding ihc following detailed description with reference to the accompanying figures 
*5 Figure 1 shows cloning of a GroESL gone. 

Figuro 2 is a diagram for preparing a plasmid pET-GroESL. 
Figure 3 shows an expression cassette of the present invention. 
Figure 4 shows construction of a plasmid pET-sFV. 
Figure 5 is a continuation of Figure 4 
^0 Figure 6 shows construction of an expression vector pET-sFV-ESL 

Figure 7 shows that the sFV is solubilized when a molecular chaperon is expressed simultaneously 
Figuro & shows a restriction map of an EcoRt-Hindlll fragment of hyperthermophiltc archaebacterium KOD-1 and 
binding Dorlions of probes having the sequence of sequence ID No. 5 and 6. 

Ftqurc 9 s*iows a sequence ol a HSP gene of hyperthermophilic archaebacterium KOD-1 and deduced ammo acid 
^5 sequence (546 ammo acids) 

Figure 10 is a SDS-PAGE of a protein expressed by a transformant of a plasmid pACYC-KOD Hsp. 
Figure 11 shows a gcf filtration pattern of the dimer form (120KDa) and polymer form {about 950KDa 16mer) of 
HSPs 

Figure 12 shows the in vitro heat stability of ADH. 

Figure 13 shows the heat stability of ADH when HSP is present. 

Figuro 14 shows the in vitro heat stability of ADH at 50°C. 

Figure 15 shows a co-lransformalion using the expression vector of the present invention. 
Figure 16 shows an increase in production of neutral amylase co-expressed with HSP. 
higurc 17 shows a solubilization ot CobQ when CobQ is co-expressed with HSP 
-^^ higurc It? shows a solubilization of sFv when sFv is co-expressed with HSP 

Figure 19 is a scheme for a formation of a functional protein GroE, which is a complex of GroEL and GroES 
As used herein "cell" means a prokaryotic cell or eukaryotic ce'l and inc'udes bacteria) ceils yeast cells plant 
cells and rrwrTirTMhan cells 

As used hofuin 'bacterial cell" means a prokaryotic cell and archaebacterium. As a prokaryotic cell, both gram 
5^ positive and gram negative bacterial celts are included 

As used herein 'foreign protein' means a protein which is not naturally found in the host (bacterial) cell. "Foreign 
pronr>olcr' means a promoter which is not naturally found in the host (bacterial) cell or a promoter which is not a re- 
Gpccitvc natural promoter for expressing a protein 

As usod horoin 'soluble" moans that substantially no inclusion bodies are found under microscopic observation 
55 Heroin after examples are described with respect to bacterial colls However, it will be readily apparent to those 

skilled in the art that the examples can bo applied to yeast celts, plant cells and mammalian cells 
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{Expression cassette) 

An expression cassette of the present invention connprises a sequence in which a gene encoding a molecular 
chaperon is operably linked to a first pronnoter and a site to which a gene encoding the desired protein can be inserted. 
s This expression cassette can be nnade by using a promoter and a gene encoding a molecular chaperon. A ternninator 
sequence can also be used if necessary. As a promoter a bacterial promoter and a phage promoter can be used 
Preferably, tac promoter, lac promoter etc., can be used. Most preferably, T7 promoter can be used for reasons of 
high expression 

As a molecular chaperon, heat shock protein (HSP) of hyperthemophilic archaeon, and GroEL, GroES, Hsp90, 
10 SccB or others derived from thermophilic bacteria such as Bacillus stearothermophilus. can be used. Among the pro- 
teins HSP of hyperthemophilic archaeon is preferably used. Archaea is considered to be a taxonomic group different 
from prokaryotes or eukaryotes. Interest in archaea, which includes hypersalt tolerant archaeon, methane producing 
archaeon and hyperthermophilic archaeon, concerns the evolutional aspects of the group. The HSP from hyperthe- 
mophilic archaeon is most preferably used since the HSP is composed of one molecule. 
?5 Thermophilic bacteria or thermophilic archaeon of the present invention refer to bacteria or archaeon which grow 

in temperatures exceeding 60°C. 

The hyperthermophilic archaeon preferably used in the present invention is defined as growing in temperatures 
more than 80''C. 

Among the Hyperthermophilic archaeon, KOD-1 is preferably used. KOD-1 is a thermophilic thiol protease pro- 
20 ducing strain which is isolated from a solfalara wharf on Kodakara Island, Kagoshima, Japan (Appl. Environ. Microbiol. 
60(12), pp. 4559-4566 (1994)). KOD-1 is deposited in the National Institute of Bioscience and Human-Technology Agen- 
cy of Industrial Science and Technology as accession No. PERM P-15007. KOD-1 was at first classified into genus 
Pyrococcus However, as described in the reference above, KOD-1 is now considered to be more closely related to 
genus Thermococcus than genus Pyrococcus according to the comparison of 16S rRNA sequences. 
25 GroEL and GroES from thermophilic bacteria can be preferably used since GroEL and GroES arc known to bind 

to a molten globule, which is an intermediate shape of a folded protein, and promote the folding of the protein. The 
ammo acid sequence and the nucleotide sequence of the Escherichia coli (E.coli) GroEL and GroES are described in 
Nature vol 333. pp. 330-334 (1908). 

As a starting vector for construction of the cassette of the present invention, any vector which is stably maintained 
30 and replicated in the bacteria! celt can be used for construction of the cassette of the present invention As the starting 
vector. pBR322, pUCIS, pUC119, pET-8c and so on can be used. Preferably pET-8c which has a bactriophage T7 
flO promoter can be used. 

An example for construction of the expression cassette will now be described: 

A first promoter is introduced into the starting vector Then, a gene sequence encoding the molecular chaperon is 
35 ligated downstream of the first promoter Introduction and ligation of these sequences can be done by a method or 
technique which is known to those skilled in the art. In order to obtain optimum expression activity, the distance between 
the first promoter sequence and the gene of the molecular chaperon can be regulated. As a molecular chaperon se- 
quence, a known sequence can be used. A HSP gene, which is a molecular chaperon of the hyperthermophilic archaeon 
KOD-1 , can be cloned by use of the conserved amino acid sequence of the chaperonin gene of the known HSP gene. 
■io Details of the screening are described in the Example. 

A terminator sequence can be positioned downstream of the molecular chaperon. A T7 phage terminator sequence 
can be preferably used The terminator sequence is useful for enhancing an expression efficiency The ligation of the 
gene of the molecular chaperon and the terminator sequence can be performed using a method known to those skilled 
in the art. 

■^5 A plasmid having a promoter sequence-molecular chaperon gene-terminator sequence can be constructed by 

inserting the molecular chaperon gene in-between the promoter sequence and terminator sequence of a ptasmid, such 
as dET-Sc having the promote^ seouence and terminator sequence The thus obtained molecular chaoeron expression 
vector can be an expression cassette of the present invention if the vector has a suitable cloning site of a gene of a 
desired piotein. If the constructed vector does not have a suitable cloning site, a cloning site can be made so as to 

^0 construct the expression cassette of the present invention As the cloning site, a multi-linker which has a various 
restriction sites can be used. The multi-linker can be purchased from a commercial source or chemically synthesized. 

To the cloning site of the expression cassette, a gene encoding the desired protein to which a second promoter is 
operably linked can be introduced. 

An expression vector having a second promoter sequence is introduced upstream of the cloning site. The intro- 

55 duced second promoter can be the same as or different from the first promoter The length of the linker can be regulated 
so as to efficiently express the desired protein. A terminator sequence can be introduced downstream of the cloning site. 
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(Expression vector) 

The expression vector of the present invention means a vector to which a gene encoding a desired protein is 
incorporated and includes an expression cassette to which the cloning site of the gene encoding the desired protein 
s IS introduced. 

(Transformation) 

A method tor transforming the expression vector to a bacterial cell is well known to those skilled in the art. For 
10 example, when Escherichia coli is used as a host. CaCl2 treatment is employed. Screening of the transformant is also 
well known to those skilled in the art. The transformant is selected by the use of drug resistance or auxotrophy, with 
drug resistance is being the generally used method. As the drug resistance gene, ampiciiiin gene, chloramphenicol 
gene, tetracycline gene and so on can be used. 

The transformant of the present invention does not always have both a molecular chaperon gene and a desired 
'5 protein gene in the same plasmid. The transformant of the present invention can be co-transformed with a vector having 
the first promoter and the molecular chaperon gene and a vector having the second promoter and the desired protein 
gene. The two vectors used for co-transformation preferably each have a different drug resistance gene for selection. 

(manufacture of a desired protein) 

The selected transformant can be cultivated by a method known to those skilled in the art. After the cultivation, 
cells are destroyed by a known method such as sonication, treatment with lysozyme. and so on. After centnfugation, 
the desired protein can be purified and recovered by a method using, for example, ammonium sulfate, ion exchange 
chromatography, column chromatography or affinity chromatography or combination thereof. 
'5 Proteins, which arc expressed as an inclusion body in the bacterial cell and can be used in the present application 

are, but not limited to, plant proteins, or animal proteins such as antibodies 

Examples: 

'0 (Example 1 construction of expression vector) 

As a molecular chaperon, GroESL of Bacillus stearothermophilus SICI (herein after referred to as SICI) was used. 
The SICI IS obtained by culturing a Bacillus stearothermophilus SI1 which was deposited to National Institute Bioscience 
and Human-Technology Agency of Industrial Science and Technology as a deposition No, PERM P-9629, 
'5 In figure 1 , a cloning procedure for the GroESL gene is shown. Chromosomal DN A was isolated by an established 

method from the starting material, SICI. The chromosomal DNA was digested with Sspl and was circularized. The 
circularized DNA was digested with EcoRI and was subjected to PGR by the use of Primers 1 and 2 having the following 
sequences 

0 

1: 5 ' -GTATGCGGATCCTGGGCGGCATGATGTAATCC-3 ' ( SEQ ID No:l) 
BamHI 



2: 5 * -GAGCTCGAATTCCGAAGTAGTTTCTTCAAGTTGC-3 ' ( SEQ ID No: 2) 
EcoRI 

PGR conditions were: 94'*C. for 1 .5 mm; 56°C, for 2,5min.; 72°C, for 3 min. 
The DNA amplified by PGR was digested with BamHI and EcoRI, cloned into pBR322, and digested with EcoRI 
(fragment 1) 

On Ihc other hand, pUC-groELC was constructed by digesting chromosomal DNA of SICI with EcoRVI and BamHI, 
isolating about a 2,5kb fragment containing a c-terminal region of GroEL, and cloned inlopUCIG, The pUC-groELC 
was digested with EcoRI(fragment 2) The EcoRI fragmont(fragment 2) was linked to the EcoRI site of the EcoRI 
digested fragment{f ragment 1 ) above, thereby constructing the plasmid pBR-GroESL having the GroESL gene of SICI 

Figure 2 depicts construction of the vector pET-GroESL GroESL gene was amplified by PGR using probes P11 
and P12 having the following sequences, respectively 
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Pll : 5 ' -AGTGC TCTAGA GAACGGCGAAAACTATCG-3 ' ( SEQ ID No : 3 ) 
Xbal 

5 

P12 : 5 * -TTTTTGGATCCGGTTTATTACATCATGCCGCC-3 ' ( SEQ ID No : 4 ) 
BamHI 

^0 By using the probes above, Xbal site and BamHI sites were introduced into the GroESL gene. The PGR was done 

under the same condition above After amplification by PCR, the gene was digested with restriction enzymes Xbal and 
BamHI, and a Xbal-BamHI fragment containing GroESL gene was recovered. 

Plasmid pET-8C which has a T7 promoter and a T7 terminator, was digested with restriction enzymes Xbal and 
BamHI To the Xbal-BamHl site, the above Xbal-BamHI fragment containing GroESL gene was ligated, thereby forming 
75 the plasmid pET-GroESL 

The thus obtained pET-GroESL was digested with Bglll, blunt ended, and digested with Hindlll Plasmid pET-8c 
was digested wtth a restriction enzyme Nhel, blunt ended, and digested with Hindlll These two fragments were ligated 
and a multi-linker was introduced at the Ncol-BamHI site, thereby forming the expression cassette depicted in Figure 3 

20 (Example 2 construction of an expression vector) 

This example of an expression vector of the present application which can co-express a molecular chaperon and 
a single strand peptide Fv (sFv) of anti-gp130 antibody GPX7 (a desired protein) 

A plasmid pET-sFV having sFv was constructed as depicted in Figures 4 and 5. Plasmid pET-8c was digested 

25 with Ncol, blunt ended with a kicnow fragment, and digested with BamHI. On the other hand, the VL gene and the VH 
gene were amplified by PCR using DNA probes as shown in Ftgure 4, ligated, and digested with the restriction enzymes 
Fspl and Bglll The obtained fragment was ligated to the above BamHt digested plasmid pET-Sc Then, the obtained 
plasmid was digested with the restriction enzymes Xhol and BamHt, and a sFV3 linker having Xhol and BamHI ends 
as shown in Figure 5 was ligated to the restriction site, thereby forming the plasmid pET-sFV 

30 Then, pET-sFV-ESL was constructed by using pET-GroESL and pET-sFV as depicted in Figure 6 Plasmid pET- 

GroESL constructed in Example 1 was digested with Bglll. blunt ended with a klenow fragment, digested with Hindlll, 
and a shorter fragment was recovered Plasmid pET-sFV was digested with Nhel. blunt ended with a klenow fragment, 
digested with Hindlll, and a larger fragment was recovered These fragments were ligated with T4 DNA tigase to 
construct a plasmid pEt-sFV-ESL The pET-sFV-ESL has a T7 promoter which is controlled by a lac operator integrated 

35 into the genome of the host cell, and therefore, the expression of the plasmid can be induced by IPTG. 

(Example 3: transformation and expression of sFV) 

Escherichia coli (E coli) BL21(DE3) was inoculated in 40ml LB medium, and cultivated at 37'*C for 3 hours. The 
40 cells were harvested and treated with 50mM CaCl2 One micro gram of the pET-sFV-ESL plasmid was added to the 
cell suspension and the suspension was then treated at 42**C tor 2.5 mm. The cells were plated on an LB medium 
containing 50(jg/ml ampicilhn and cultivated at 37°C for 18 hours, thereby obtaining the derived transformants. 

Transformants containing pEt-sFV-ESL plasmid were cultivated in 100ml of NZCYfvl medium at 37°C. The com- 
position of NZCYM medium ts NZ amine 1%: NaClO 5%: yeast extract0.57o: casaminoacidO 1%: MgS04-7H20 0 2%: 
pH 7 with NaOH 

After 2 hours, the transformants was induced by 0 1mf\/l IPTG for 3 hours. After induction the tranformants were 
centrifuged, washed with 30mM Tris-NaCl buffer (pH8 0), resuspended in Iml of the same buffer and then lysed by 
sonication The lysate was centdfuged lo obtain a supernatant fraction (soluble fraclion) and precipilale traction (in- 
soluble fiaction) 

The insoluble fraction was dissolved with Triton X-100, The dissolved fraction of the precipitate was subjected to 
SDS-PAGE in order to detect the expressed sFV As controls, soluble and insoluble fractions from E colt BL21(DE3) 
transformed with a plasmid pET-sFV were used Under microscopic observation, inclusion bodies were not substantially 
found in transformants having the plasmid pEt-sFV-ESL. however, inclusion bodies were found in transformants having 
the plasmid pEt-sFV 

Figure 7 shows a result of an SDS-PAGE of sFV obtained from each transformant The left column is a control 
As clearly shown in the figure substantially no sFV was found in the soluble traction but sFV was found in the insoluble 
fraction The right column is a case where sFV and molecular chaperon were co-expressed Almost all sFV was found 
in the soluble fraction and a small amount of sFV was found in the insoluble fraction 
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(Example 4: cloning of HSP gene from KOD-1 ) 

KOD-1 was cultivated in a 2 litre fermentation jar. KOD-1 was inoculated in 1 hire of 0,5x2216 marine broth medium 
(2216 marine broth 18 7g/L: PIPES 3.48g/L; CaCU. HoO g/L: 0,4 mL of 0,2% resazurin: 475mL of artificial sea water 
(NaCI28 16g/L: KCI0.7g/L. MgClo-BHoO 5,5 g/L; Mgs64-7H20 6. 9 g/L) , 500 mL of distilled water, pH7 0) as described 
in AppI Environ Microbiol 60(12), pp, 4559-4565 (1 994), The air in the jar was replaced by nitrogen gas and the inner 
pressure was maintained at 0,1 Kg/cm^. The culture was grown at 85+1 ''C for 14 hours, without agitation or bubbling 
with nitrogen gas After cultivation, the culture broth (about 1 ,000ml) was centrifuged at 10,000rpm for 10 min. The 
cells were harvested 

Chromosomal DNA was extracted according to a well known method and digested with EcoRl and Hindlll. Frag- 
ments of about 4.5kb were isolated and ligated to pUCI 8 and transformed to E.coli JM1 09, thereby preparing the gene 
library This library was used to clone a HSP gene of hyperthermophilic archaeon KOD-1. Although a promoter gene 
of the hyperthermophilic archaeon cannot work well in E coli, the cloned gene of the Hyperthermophilic archaeon can 
be expressed since pUCIB has a lac promoter just upstream of the cloning site. Furthermore, since the 16SrRNA 
binding seguence which is necessary for translating the hyperthermophilic archaeon gene can be functional in E.coli, 
the cloned hyperthermophilic archaeon gene can be expressed. 

Cloning probes were prepared in consideration of the conserved ammo acid sequence of the chaperonin genes 
which are encoded by known HSP genes. The sequences of the probes were: 

PN: 5 ' -GGGNGTACCACNATHACNAAYGAYGGNGC-3 * ( SF.Q ID No : 5 ) 



PC : 5 • -GGCATNCCRAARAGGATHGARAAYGC-3 ' 



( SEQ ID No: 6 ) 



30 



35 



40 



45 



50 



55 



wherein N is one of G, A, T, or C; Y is T or C; H is A, T, or C, and R is G or A. 

These two probes were used to screen HSP gene of Hyperthermophilic archaeon. Seven positive colonies were 
selected by colony hybridization from about 1,000 transformants. Southern hybridization was performed as a second 
screening Figure 8 shows a restriction enzyme map of the obtained 4 5 kb fragment. The probes were hybridized to 
a site, which is shown by oblique lines 

PGR was perfonmed using the sequences of ID numbers 5 and 6. DNA sequences were determined by dideoxy 
chain termination method using a fluorescence labeled pnmer (AutoRead'^'^, Pharmacia, Upsala, Sweden). The DNA 
sequence data was analyzed using DNASIS™ (Hitachi Software). 

The HSP gene sequence and the deduced amino acid sequence (546 amino acids) of hyperthermophilic archaeon 
KOD-1 are shown in Figure 9 (SEQ ID No:7). An SD sequence was found upstream of the initiation coden. Figure 1 
IS a comparison of the homology of the amino acid sequence between HSPs of KOD-1 and other strains. 

Table 1 





TF55 


TOPE mouse 


TO PA yeast 


TCPA human 


Bs groEL 


SsdnaK 


P/fHSP 


56,3 


42.8 


38.4 


39 4 


21.1 


10.0 



Amino acid sequence comparisons (%) 



TF55: Sulfoiobus s/?/t>afaethermophi!e factor 55 
TCPE mouse: mouse t-complex protein E unit 

TCPA yeast; Saccharomyces cerevisiae t-complex protein alpha unit 
TCPA human: human t-complex protein alpha unit 
groEL: Bacillus stearothermophilus groEL 
dnAK: Bacillus stearothermophilus dnAK 



As shown in Table 1. HSP of KOD-1 has a high amino acid homology of 56.3% with TF55 of Sulfoiobus shibatae, 
and an amino acid homology of 38.4% and 42,8% with the t-complox polypcptido-1 of yeast and mouse, respectively 
Only 21 1% homology was found with GroEL of B, stearothermophilus SICI. 

(Example 5 construction of expression cassette of HSP gene of KOD-1) 

The primers 
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Pkl: AGGGGCCATGGCCCAGCTCGCAGGCCAGC ( SEQ ID NO: 8) and 
NCOl 

5 

Pk2: AAAAG GGATCC AAGGTCATCAGTCAAGG (SEQ ID NO : 9 ) 
BamHI 

10 

were used as amptificalion primers for PGR of the HSP gene of KOD-1 . The PGR conditions were: 94°G, for 1 ,5 mm; 
56^0, for 2 5min ; 72°G. for 3 min. 

The obtained gene was digested with Ncol and BamfHi and ligated to the Ncol-BamHI site of pET-6C, thereby 
forming a plasmid pET-KOD Hsp. The plasmid pET-KOD fHsp has a T7 promoter which is controlled by a lac operator 
^5 integrated into the genome of the host celt; therefore, expression of the plasmid pET-KOD Hsp can be induced by IPTG. 

(Example 6: purification of HSP) 

Plasmid pET-KOD Hsp was transformed to E.coli BL21{DE3), Transformants were cultured at 37^*0 in NZCYr\/I 
20 medium. The Iransformanls were induced with 0.1 mM IPTG for 3 hours. The cells were then cenlrifuged, suspended 
in TE50-1 buffer (SOmfvl Tris HCl, pH8.0, ImM EDTA) and sonicated at intervals of 30 X 40 seconds and 20 seconds 
on ice. The treated cells were centrifuged at 4°C at 6,000rpm for 10 minutes and the soluble fraction and insoluble 
fraction was separated. The soluble fraction was treated at 4°C with 80% ammonium sulfate overnight, and the pre- 
cipitate was centrifuged at 8,000rpm for 20 min. The precipitate was re-suspended in TE50-1 buffer and dialyzed 
25 overnight. The dialyzed fraction was heat treated at 94°C for 20 mm. and centrifuged at 12,000rpnn for 20 min at 4°C. 
Proteins were purified by using HiTraP DEAE anion exchange chromatography (FPLC system, Pharmacia, Sweden) 
with a two-solvent system at a rate of Iml/min Solvent A was 50mM phosphate buffer pH 6.2, and solvent B contained 
1 .5M NaCI in solvent A. HSP was eluted at 0.5M NaCI and showed a single band of 60KDa in SDS-PAGE (Figure 10). 
The gel filtration pattern with superdex 200 MR 10/30, FPLC system, Pharmacia) pattern showed a dimer form and 
30 (120Kda) and a polymer form (about 950KDa : 1 6mer)(figure 11) 

(Example 7; Increase of heat stability of alcohol 

dehydrogenase (ADH) by using Heat Shock Protein) 

35 

In order to investigate the functions of the molecular chaperon of HSP, the heat stability of ADH was investigated 
in vitro under heat stress. Figure 12 shows heat stability of ADH in vitro. ADH has a maximum activity at about 30°C, 
the ADH activity rapidly decreased at about 50°G, and was substantially inactivated at about 70°C, However, in the 
case where ADH and HSP co-existed, the rate of decrease of ADH activity at high temperature was slowed (Figure 

-^0 1 3). This result suggested that HSP binds to thermally unfolded or partially folded ADH. The function of the chaperon, 
i.e., to maintain the enzymatic active state of ADH in vitro at 50°C, was reproduced. The result is shown in Figure 15. 
After treatment at SO'^C for 20 min., the remaining ADH activity was about 11% without HSP, whereas the remaining 
ADH activity was about 100% in the presence of HSP (0.25|aM). With increase of the HSP concentration, the effect 
became more remarkable (Figure 14). Further, even with low concentration of HSP, ATP could increase the heat stability 

•^^ ol ADH, however in the presence of a high concentration of HSR ATP did not affect heat stability. Although both the 
dimer and polymer form of HSP had chaperon activity (data not shown), the polymer form of HSP could reveal much 
higher effects than the dimer form 

ADH aclivily was assayed by monitoring a decrease in absorbance of elhanol dependent NAD at 340nm. ADH 
activity was expressed as lamoles of NADH produced per minute, calculated with a molar extinction coefficient of 

50 6.22mM cm-V Standard ADH assay was performed in a mixture at 25'*C with lOOmM Glycine-KOH buffer(pH 8.8) 
containing ImM NAD and lOOmf^ ethanol. A Shimazu UV-visua! recording type photometer UV-16G was used to de- 
termine the absorbance of 340 nm. 

(Example 6 the construction of plasmids for co-expression and transformation) 

55 

Since the effect of HSP on protein stabilization was confirmed in Example 7, the co-oxpression system (plasmid) 
was constructed for expressing HSP and the desired protein simultaneously 

Plasmid pET-KOD Hsp as obtained in Example 5 was digested with restriction enzymes BamHI and Bglll and a 
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DMA fragment having a T7 promoter-KOD Hsp-T7 ternninalor was obtained. This fragment was introduced into the 
B?imHI site of pACYC1B4 which is compatible with a series of pET vectors, thereby constructing a plasmid pACYC- 
KOD Hsp having a chloramphenicol resistance gene. 

The thus obtained plasmid pACYC-KOD Hsp and pET-8C were co-transformed with E.coli BL21 (DE3), The trans- 
formants were screened for resistance to both ampicillin and chloramphenicol. The HSPs were purified according to 
the same method in Example 6, showing a single band of MW 60KDa with SDS-PAGE and a polymer form of HSP of 
about 950KDa was detected. As was confirmed by this result, since co-transformation of plasmid pACYC-KOD Hsp 
and senes of pET vectors are possible, it is possible to co-express the HSP and the desired protein by incorporating 
the gene ol the desired protein into a cloning site of pET-8c, as shown figure 15 

(Example 9; co-expression of HSP and neutral amylase of KOD-1) 



In the co-expression system obtained in Example 8, HSP and neutrals amylase of KOD-1 were co-expressed The 
neutral amylase of KOD-1 was screened as follows: 

Chromosomal ON A of KOD-1 as obtained in Example 4 was digested with EcoRI. Fragments of about 7.5kb were 
isolated and inserted into the EcoRI site of pUC18 transformed to E.coli JM109, and the gene library was prepared. 
The transformants were grown on a starch azure agar (L-agar containing a final concentration of starch azure 0.25% : 
amylase activity indicating medium) containing ampicillin, heat treated at 60°C overnight, and a characteristic halo- 
forming colony was selected. The amylase obtained from this colony was confirmed to be neutral amylase by its Opti- 
mo mum pH of 5.0 lo 7.0. 

The cloned DNA fragment was isolated from the transformant and its DNA sequence was determined. The DNA 
was amplified by PGR using the following primers: 

2^ SD2 : 5 ' -TGGTACCATGGCAAAGTATTCCGAACTCGA-3 ' (SEQ ID No : 10 ) 

Ncol 



^0 E5: 5 * -CGGATCCGATATCAGCTATGACCTTTA-3 ' (SEQ ID No: 11) 

BamHI 

The neutral amylase gene obtained was digested with Ncol and BamHI, incorporated into the Ncol and BamHI 
3S site of pET-8c. thereby constructing a plasmid pET-NAmy. Plasmids pACYC-KOD Hsp and pET-NAmy are co-trans- 
formed With E.coli BL21(DE3). and a strain resistant to both ampicillin and chloramphenicol were selected. The trans- 
formants were cultured in NZCYM medium in the same manner as in Example 6 and induced by IPTG for 3 hours. As 
a control, E.coli BL21(DE3) transformed with pET-NAmy alone was used. The results are shown in Figure 16. The 
neutral amylase aggregates at pH 5.0 but is soluble at pH 8.0, The transformants were sonicated at pH 5.0 and pH 
^0 8,0, and the cells were fractionated. SDS-PAGE and active staining showed an increase of amylase production per 
cell in the co-transformed cells at pH 5.0. Further, an increase of annylase expression in the soluble fraction was rec- 
ognized at pH 8,0. 



(Example 10: co-expression of cobyric acid synthetase (CobQ) and HSP) 



When CobQ of KOD-1 is expressed in E coli, soluble CobQ and insoluble inclusion body of CobQ are equally 
expressed 

When analyzing Ihe genome of KOD-1 , a comparison with sequences of Salmonella and Pseudomonas revealed 
that the CobQ gene was included in 4.5Kb Hindlll fragment. The CobQ gene was amplified by PGR using the following 
50 two primers: 

COB-1: 5 ' -GTGA CCATGG GAAAGGCGCTGATGGTTCA (SEQ ID No: 12) 

55 Ncol 
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COB-2 : 5 ' -CTA GGATCC AAGTCTCTGGATTATGTACTGGA ( SEQ ID No: 13) 
BamHI 

5 

The obtained CobQ pene was digested With Ncol and BamHI, cloned into Ncol-BamHI site of pET-8c. thereby 
constructing a plasmid pET-CobQ. Plasmid pACYG-KOD Hsp and pET-GobQ were co-transformed to E.coli BL21 
(DE3), and ampicillin and chloramphenicol resistant transformants were selected The transformants were cultivated 
in NZCYM medium as described in Example 6 and induced by IPTG for 3 hours. As a control, E coli BL21 (DE3) trans- 
10 formed with pET-CobQ alone was used. The results are shown in Figure 1 7 As can be seen in the figure, the number 
of insoluble inclusion bodies of CobQ decreased when co-expressed with HSP and the expression of soluble CobQ 
was increased. 

(Example 11: co-expression of sFV and HSP) 

15 

Plasmid pACYC-KOD Hsp and plasmid pET-sFV prepared in Example 2 were co-transformed to E.coli BL21 (DES) 
The transformant was cultured in NZCYM medium. IPTG was added when O.D.660nm of the culture medium reached 
0.3 or 1 .0, and the culture was induced for 1 or 5 hours. By co-expression with HSP, sFV was produced as a soluble 
fraction (Figure 18). 

^0 Various other modtlicalions will be apparent to and can be readily made by those skilled in the art without departing 

from the scope and spirit of this invention. Accordingly, it is not intended that the scope of the claims appended hereto 
be limited to the description as set forth herein, but rather that the claims be broadly construed. 

Sequence Listing 
SEQ ID N0:1 
LENGTH: 32 
30 SEQUENCE TYPE: Nucleic acid 

STRANDNESS : Single 
TOPOLOGY: Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 

GTATG CGGAT CCTGG GCGGC ATGAT GTAAT CC 32 

40 

SEQ ID NO: 2 
LENGTH: 34 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS: Single 
TOPOLOGY: Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 

GAGCT CGAAT TCCGA AGTAG TTTCT TCAAG TTGC 34 
55 SEQ ID NO: 3 
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LENGTH: 29 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Single 
TOPOLOGY: Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 

AGTGC TCTAG AGAAC GGCGA AAACT ATCG 

SEQ ID NO: 4 
LENGTH: 32 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Single 
TOPOLOGY: Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 

TTTTT GGATC CGGTT TATTA CATCA TGCCG CC 

SEQ ID NO: 5 
LENGTH: 29 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Single 
TOPOLOGY: Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 

GGGNG TACCA CNATH ACNAA YGAYG GNGC 

SEQ ID NO: 6 
LENGTH: 26 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS: Single 
TOPOLOGY: Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 
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GGCAT NCCRA ARAGG ATHGA RAAYG C 26 

SEQ ID N0:7 
LENGTH: 1800 

SEQUENCE TYPE: Nucleic acid 
5TRANDNESS : Both 
TOPOLOGY : Unknown 
MOLECULAR TYPE: Genomic DNA 
SEQUENCE: 

GCTTTTAATC ATTACCGAAA ACTTTATAAA TAGCACAAAA -81 
GAACAATAGC GCGGAAAACA CGAATTGTAA CTAAAACTCA -41 
TCCACCCTCA AAAACAAAAA AAGGGTGGGG GTGAGGGGAG ATG GCC 6 

Met Ala 
1 

CAG CTC GCA GGC CAG CCA GTT GTT ATT CTG CCC GAG GGA 45 
Gin Leu Ala Gly Gin Pro Val Val lie Leu Pro Glu Gly 
5 10 15 

ACC CAG AGG TAT GTT GGA AGG GAC GCC CAG AGG CTC AAC 84 
Thr Gin Arg Tyr Val Gly Arg Asp Ala Gin Arg Leu Asn 

20 25 
ATT CTT GCT GCC AGG ATT ATA GCC GAG ACG GTT AGA ACC 123 
lie Leu Ala Ala Arg lie lie Ala Glu Thr Val Arg Thr 

30 35 40 

ACC CTC GGT CCA AAG GGA ATG GAC AAG ATG CTC GTT GAC 162 
Thr Leu Gly Pro Lys Gly Met Asp Lys Met Leu Val Asp 

45 50 
AGC CTC GGC GAC ATC GTC ATC ACC AAC GAC GGT GCA ACC 201 
Ser Leu Gly Asp lie Val lie Thr Asn Asp Gly Ala Thr 

55 60 65 

ATT CTC GAC GAG ATG GAC ATC CAG CAC CCT GCT GCT AAG 240 
lie Leu Asp Glu Met Asp lie Gin His Pro Ala Ala Lys 
70 75 80 

ATG ATG GTT GAG GTT GCT AAG ACT CAG GAC AAG GAG GCC 279 
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70 



75 



20 



Met Met Vai Glu Val Ala Lys Thr Gin Asp Lys Glu Ala 

85 90 
GGT GAG GGA ACC AGO ACT GCC GTT GTC ATC GCC GGT GAG 318 
Gly Asp Gly Thr Thr Thr Ala Val Val lie Ala Gly Glu 

95 100 105 

CTT CTG AGG AAG GCT GAG GAG CTT CTC GAG GAG AAC ATT 357 
Leu Leu Arg Lys Ala Glu Glu Leu Leu Asp Gin Asn lie 

110 115 
CAC CCG AGG ATA ATC ATC AAG GGT TAG GCC CTC GCG GCA 396 
His Pro Ser lie lie lie Lys Gly Tyr Ala Leu Ala Ala 
120 125 130 

GAG AAA GCC GAG GAA ATA CTC GAG GAG ATA GCC AAG GAG 435 
Glu Lys Ala Gin Glu lie Leu Asp Glu lie Ala Lys Asp 
135 140 145 

25 GTT GAG GTC GAG GAC AGO GAG ATT CTC AAG AAG GCC GCG 474 

Val Asp Val Glu Asp Arg Glu lie Leu Lys Lys Ala Ala 

150 155 
GTC ACC TCC ATC ACC GGA AAG GCT GCC GAG GAG GAG AGG 513 
Val Thr Ser lie Thr Gly Lys Ala Ala Glu Glu Glu Arg 

160 165 170 

GAG TAG CTC GCT GAG ATA GCA GTT GAG GCC GTC AAG CAG 5 52 
Glu Tyr Leu Ala Glu He Ala Val Glu Ala Val Lys Gin 

175 180 
GTT GCC GAG AAG GTT GGC GAG ACC TAG AAG GTC GAC CTC 591 
Val Ala Glu Lys Val Gly Glu Thr Tyr Lys Val Asp Leu 
185 190 195 

GAC AAC ATC AAG TTC GAG AAG AAG GAA GGT GGA AGC GTC 630 
Asp Asn He Lys Phe Glu Lys Lys Glu Gly Gly Ser Val 
200 205 210 

AAG GAC ACC CAG CTC ATA AAG GGT GTC GTC ATC GAC AAG 669 
Lys Asp Thr Gin Leu lie Lys Gly Val Val lie Asp Lys 

215 220 
GAG GTC GTC CAC CCA GGC ATG CCG AAG AGG GTC GAG GGT 708 
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Glu Val Val His Pro Gly Met Pro Lys Arg Val Glu Gly 

225 230 235 

GCT AAG ATC GCC CTC ATC AAC GAG GCC CTC GAG GTC AAG 747 
Ala Lys lie Ala Leu lie Asn Glu Ala Leu Glu Val Lys 

240 245 
GAG ACC GAG ACC GAC GCC GAG ATC AGG ATC ACC AGC CCG 786 
Glu Thr Glu Thr Asp Ala Glu lie Arg lie Thr Ser Pro 
250 255 260 

GAG CAG CTC CAG GCC TTC CTT GAG CAG GAG GAG AAG ATG 825 
Glu Gin Leu Gin Ala Phe Leu Glu Gin Glu Glu Lys Met 
265 270 275 

CTC AGG GAG ATG GTC GAC AAG ATC AAG GAG GTC GGC GCG 864 
Leu Arg Glu Met Val Asp Lys lie Lys Glu Val Gly Ala 

280 285 
AAT GTC GTC TTC GTC CAG AAG GGC ATT GAC GAC CTC GCC 903 
Asn Val Val Phe Val Gin Lys Gly lie Asp Asp Leu Ala 

290 295 300 

CAG CAC TAC CTT GCC AAG TAG GGC ATA ATG GCC GTT AGA 942 
Gin His Tyr Leu Ala Lys Tyr Gly lie Met Ala Val Arg 

305 310 
AGG GTC AAG AAG AGC GAC ATG GAG AAG CTC GCC AAG GCC 981 
Arg Val Lys Lys Ser Asp Met Glu Lys Leu Ala Lys Ala 
315 320 325 

ACC GGC GCC AAG ATC GTC ACC AAC GTC CGC GAC CTC ACT 1020 
Thr Gly Ala Lys lie Val Thr Asn Val Arg Asp Leu Thr 
330 335 340 

CCG GAG GAC CTC GGT GAG GCC GAG CTC GTC GAC CAG AGG 1059 
Pro Glu Asp Leu Gly Glu Ala Glu Leu Val Asp Gin Arg 

345 350 
AAG GTC GCC GGC GAG AAC ATG ATC TTC GTC GAG GGC TGC 1098 
Lys Val Ala Gly Glu Asn Met lie Phe Val Glu Gly Cys 

355 360 365 

AAG AAC CCG AAG GCC GTC ACA ATA CTC ATC AGG GGC GGC 1137 
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Lys Asn Pro Lys Ala Val Thr lie Leu lie Arg Gly Gly 

370 375 
ACC GAG CAC GTC GTT GAT GAG GTC GAG AGG GCC CTT GAG 1176 
Thr Glu His Val Val Asp Glu Val Glu Arg Ala Leu Glu 
380 385 390 

GAG GCC GTC AAG GTC GTC AAG GAG ATC GTC GAG GAC GGC 1215 
Asp AlQ Val Lys Val Val Lys Asp lie Val Glu Asp Gly 
395 400 405 

AAG ATC GTC GCC GCC GGT GGT GCT CCG GAG ATC GAG GTC 1254 
Lys lie Val Ala Ala Gly Gly Ala Pro Glu lie Glu Leu 

410 415 
GCC ATC AGG CTC GAC GAG TAG GCG AAG GAG GTC GGC GGC 1293 
Ala lie Arg Leu Asp Glu Tyr Ala Lys Glu Val Gly Gly 

420 425 430 

AAG GAG CAG CTC GCC ATC GAG GCC TTT GCC GAG GCC CTC 1332 
Lys Glu Gin Leu Ala lie Glu Ala Phe Ala Glu Ala Leu 

435 440 
AAG GTC ATC CCG AGG ACC CTC GCC GAG AAC GCC GGT CTC 1371 
Lys Val lie Pro Arg Thr Leu Ala Glu Asn Ala Gly Leu 
445 450 455 

GAC CCG ATC GAG ACC CTC GTT AAG GTC ATC GCC GCC CAC 1410 
Asp Pro lie Glu Thr Leu Val Lys Val lie Ala Ala His 
460 465 470 

AAG GAG AAG GGA CCG ACC ATC GGT GTT GAC GTC TTC GAG 1449 
Lys Glu Lys Gly Pro Thr lie Gly Val Asp Val Phe Glu 

475 480 
GGC GAG CCG GCC GAC ATG CTC GAG CGC GGC GTT ATC GCC 1488 
Gly Glu Pro Ala Asp Met Leu Glu Arg Gly Val lie Ala 

485 490 495 

CCG GTC AGG GTT CCG AAG CAG GCC ATC AAG AGC GCC AGG 1527 
Pro Val Arg Val Pro Lys Gin Ala lie Lys Ser Ala Ser 

500 505 
GAG GCT GCC ATA ATG ATC CTC AGG ATC GAC GAC GTC ATC 1566 
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Glu Ala Ala He Met He Leu Arg He Asp Asp Val He 
510 515 520 

GCC GCC AGC AAG CTC GAG AAG GAC AAG GAG GGC GGC AAG 1605 
Ala Ala Ser Lys Leu Glu Lys Asp Lys Glu Gly Gly Lys 
525 530 535 

GGC GGT AGC GAG GAT TTC GGA AGC GAC CTT GAC 1638 
Gly Gly Ser Glu Asp Phe Gly Ser Asp Leu Asp 

540 545 546 

TGAAGCCCTT TGATTTCTTT TCTCTTCAAA TTTGTGTTCT TA 1680 



SEQ ID NO: 8 
LENGTH: 29 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Single 
TOPOLOGY: Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 

AGGGG CCATG GCCCA GC1CG CAGGC CAGC 29 



SEQ ID NO: 9 
LENGTH: 28 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Single 
TOPOLOGY : Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 

AAAAG GGATC CAAGG TCATC AGTCA AGG 28 



SEQ ID NO: 10 
^0 LENGTH: 30 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Single 
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TOPOLOGY: Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 

TGGTA CCATG GCAAA GTATT CCGAA CTCGA 30 

SEQ ID NO: 11 
LENGTH: 27 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Single 
TOPOLOGY : Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE : 

CGGAT CCGAT ATCAG CTATG ACCTT TA 27 



SEQ ID NO: 12 
LENGTH: 29 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Single 
TOPOLOGY : Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 

GTGAC CATGG GAAAG GCGCT GATGG TTCA 29 



SEQ ID NO: 13 

40 

LENGTH: 32 

SEQUENCE TYPE: Nucleic acid 
STRANDNESS : Single 
TOPOLOGY: Linear 

MOLECULAR TYPE: Other nucleic acid, synthetic DNA 
SEQUENCE: 

CTAGG ATCCA AGTCT CTGGA TTATG TACTG GA 32 
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SEQUENCE LISTING 



(1 ) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: IMANAKA. TADAWKI 

(B) S TREH'I : 2-28-1 ! FUJISHIRO-DAI 

(C) CITY: OSAKA 

(E) COUNTRY: JAPAN 

(F) POSTAL CODE (ZIP): . 

(ii) TITLE OF INVENTION: A METHOD FOR PRODUCTION OF 
PROTEIN USING 

MOLECULAR CHAPERON. 

(iii) NUMBER OF SEQUENCES: 14 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1.30 (EPO) 

(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 96306713.7 
(vi) PRIOR APPLICATION DATA: 

(A) APPLICAI ION NUMBER: JP 7-237176 

(B) FILING DATE: 14-SEP-1995 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: JP 8-228252 

(B) FILING D.^TE: 29-AUG-1996 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHAR.ACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIP HON: SEQ ID NO: I : 
GTATGCGGAT CCTGGGCGGC ATGATGTAAT CC 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARAC TERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GAGCTCGAAT TCCGAAGTAG TTTCTTCAAG TTGC 
34 

(2) INFORMA TION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHAR^ACTFRISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AGTGCTCTAG AGAACGGCGA AAACTATCG 
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(2) INFORMATION FOR SEQ^ ID NO: 4: 

5 (i) SEQUENCE CHARACEERIS llC'S: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
w (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

20 

TTTTTGGATC CGGTTTATTA CATCATGCCG CC 
(2) INFORMATION FOR SHQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGGNGTACCA CNATHACN.^ YGAYGGNGC 29 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



BNSDOCID: <EP 0774512A2 I > 



21 



EP 0 774 512 A2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

5 

GGCATNCCRA ARAGGATHGA RAAYGC 26 
(2) INFORMATION FOR SEQ ID NO: 7: 

10 

(i) SEQUENCE CHARACTERISTICS: 
( A) LENGTH: 1 800 base pairs 

(B) TYPE: nucleic acid 
'5 (C) STRANDEDNESS: both 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
25 (B)LOCATION:121..1761 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

30 

GCTTTTAATC ATTACCGAAA ACTTTATAAA TAGCACAAAA 
GAACAATAGC GCGGAAAACA 60 

35 CGAATTGTAA CTAAAACTCA TCCACCCTCA AAAACAAAAA 

AAGGGTGGGG GTGAGGGGAG 120 

A TG GCC CAG CTC GCA GGC CAG CCA GTT GTT ATT CTG CCC GAG 
40 GGAACC 168 

Mel Ala Gin Leu Ala Gly Gin Pro Val Val lie Leu Pro GIu Gly Thr 
1 5 io 15 

4s CAG AGG TAT GTT GGA AGG GAC GCC CAG AGG CTC AAC ATT CTT 

GCTGCC 216 

Gin Arg Tyr Val Gly Arg Asp Ala Gin Arg Leu Asn He Leu Ala Ala 
20 25 30 

so 

AGG ATT ATA GCC GAG ACG G'lT AGA ACC ACC CTC GGT CCA AAG 
GGA ATG 264 



BNSDOCID: <EP_ . 077451 2A2_ l_. > 



22 



EP 0 774 512 A2 



Ar- lie lie Ala (ilu Thr Val Arg Thr Thr Leu Gly Pro Lys Gly Met 
35 40 45 

GAC AAG A l Ci C\V GTT GAC AGC C TC GGC GAC ATC G TC ATC ACC 
AACGAC 312 

Asp l.ys Met Leu Val Asp Ser Leu Gly Asp He Val He Thr Asn Asp 
50 ' 55 60 

GG 1 GCA ACC All C I C GAC GAG A I G GAC A I C CAG CAC CC 1 GC 1 
GCT AAG 360 

Gly Ala Thr lie Lou Asp Glu Mel Asp He Gin His Pro Ala Ala Lys 
65 70 75 80 

ATG ATG GTT GAG GTT GCT AAG ACT CAG GAC AAG GAG GCC GGT 
GAC GGA 408 

Met Met Val Glu Val Ala Lys Thr Gin Asp Lys Glu Ala Gly Asp Gly 
85 90 95 

ACC ACC AC T GCC G IT GCC A TC GCC GGT GAG C r i C I G AGG AAG 
GCT GAG 456 

Thr Thr Thr Ala Val Ala He Ala Gly Glu Leu Leu Arg Lys Ala Glu 

100 105 no 

GAG C IT C rc GAC CAG AAC All CAC CCG AGC A 1 A A I C ATC AAG 
GGT TAC 504 

Glu Leu Leu Asp Gin Asn He His Pro Ser He He He Lys Gly Tyr 
115 120 125 

GCC CTC GCG GCA GAG AAA GCC CAG GAA A TA C I C GAC GAG ATA 
GCC AAG 552 

Ala Leu Ala Ala Glu Lys Ala Gin Glu He Leu Asp Glu He Ala Lys 
130 135 " 140 

GAC GTT GAC GTC GAG GAC AGCi GAG All C I C A.\G AAG GCC GCG 
GTC ACC 600 

Asp Val Asp Val Glu Asp Arg Glu He Leu Lys Lys Ala Ala Val Thr 
145 150 155 160 

TCC ATC ACC GGA AAG GCT GCC GAG GAG GAG AGG GAG TAC CTC 
GCT GAG 648 

Ser lie Thr Gly Lys Ala Ala Glu Glu Glu Arg Glu Tyr Leu Ala Glu 
165' 170 175 
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ATA GCA GTT GAG GCC GTC AAG GAG GTT GCC GAG AAG GTT GGC 
GAG ACC 696 

lie Ala Val Glu Ala Val Lys Gin Val Ala Glu Lys Val Gly Glu Thr 
180 185 190 

TAG AAG GTC GAG CTC GAG AAG ATC AAG TTC^ GAG AAG AAG GAA 
GGT GGA 744 

Tyr Lys Val Asp Leu Asp Asn He Lys Phe Glu Lys Lys Glu Gly Gly 
195 200 205 

AGC GTC AAG GAC ACC GAG CTC ATA AAG GGT GTC GTC ATC GAC 
AAG GAG 792 

Ser Val Lys Asp Thr Gin Leu He Lys Gly Val Val He Asp Lys Glu 
210 215 220 

GTC GTC CAC CCA GGC ATG CCG AAG AGG G TC GAG GGT GC I AAG 
ATC GCC 840 

Val Val His Pro Gly Met Pro Lys Arg Val Glu Gly Ala Lys He Ala 
225 230 235 240 

C 1 C ATC AAC GAG GCC CTC GAG GTC .^AG GAG ACC GAG ACC GAC 
GCC GAG 888 

Leu He Asn Glu Ala Leu Glu Val Lys Glu Thr Glu Thr Asp Ala Glu 
245 250 255 

ATC AGG ATC ACC AGC CCG GAG CAG CTC CAG GCC TTC CTT GAG 
CAG GAG 936 

He Arg lie Thr Ser Pro Glu Gin Leu Gin Ala Phe Leu Glu Gin Glu 
260 265 270 

GAG AAC ATG CTC AGG GAG ATG GTC GAC AAG ATC AAG GAG GTC 
GGC GCG 984 

Glu Lys Met Leu Arg Glu Met Val Asp Lys He Lys Glu Val Gly Ala 
275 280 285 

AAT GTC GTC TTC GTC CAG AAG GGC ATT GAC GAC CTC GCC CAG 
CAC TAG 1032 

Asn Val Val Phe Val Gin Lys Gly He Asp Asp Leu Ala Gin His Tyr 
290 295 300 

CTT GCC AAG TAG GGC ATA ATG GCC GTT AGA AGG GTC AAG AAG 
AGC GAC 1080 

Leu Ala Lys Tyr Gly He Met Ala Val Arg Arg Val Lys Lys Ser Asp 
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305 310 315 320 

^ ATG GAG /V.\G CTC GCC .\AG GCC ACC GGC GCC AAG ATC GTC ACC 

AACGTC 1128 

Met Glu Lys Leu Ala Lys Ala Tlir Gly Ala Lys He Val Thr Asn Val 
325 330 335 

10 

CGC GAC CTC ACT CCG GAG GAC CTC GGT GAG GCC GAG CTC GTC 
GAC CAG 1 1 76 

Arg Asp Leu Thr Pro Glu Aap Leu Gly Glu Ala Glu Leu Val Asp Gin 
340 345 350 

AGG AAG GTC GCC GGC GAG AAC ATG ATC TTC GTC GAG GGC TGC 
AAGAAC 1224 

^° Arg Lys Val Ala Gly Glu Asn Met lie Phe Val Glu Gly Cys Lys Asn 

355 360 365 

CCG AAG GCC GTC ACA ATA CTC ATC AGG GGC GGC ACC GAG CAC 
GTCGTT 1272 

Pro Lys Ala Val Thr lie Leu lie Arg Gly Gly Thr Glu His Val Val 
370 375 380 

GAT GAG GTC GAG AGG GCC CTT GAG GAC GCC GTC AAG GTC GTC 
AAG GAC 1320 

Asp Glu Val Glu Arg Ala Leu Glu Asp Ala Val Lys Val Val Lys Asp 
385 390 395 400 

35 

ATC GTC GAG GAC GGC AAG ATC GTC GCC GCC GGT GGT GCT CCG 
GAG ATC 1368 

He Val Glu Asp Gly Lys He Val Ala Ala Gly Gly Ala Pro Glu He 
40 405 410 415 

CiACi CTC GCC A I C AGG CTC GAC GAG TAC GCG AAG GAG GTC GGC 
GGC AAG 1416 

45 Glu Leu Ala He Arg Leu Asp Glu Tyr Ala Lys Glu Val Gly Gly Lys 

420 425 430 

GAG CAG CTC GCC ATC GAG GCC TTT GCC GAG GCC CTC AAG GTC 
50 ATC CCG 1464 

Glu Gin Leu Ala lie Glu Ala Phe Ala Glu Ala Leu Lys Val He Pro 
435 440 445 

„ AGG ACC CTC GCC GAG AAC GCC GGT CTC GAC CCG ATC GAG ACC 
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CI con 1512 

Arg Thr Leu Ala Glu Asn Ala Gly Leu Asp Pro He Glu Thr Leu Val 
5 450 455 460 

AAG GTC ATC GCC GCC CAC AAG GAG AAG GGA CCG ACC ATC GGT 
GTT GAC 1 560 

'° Lys Val 11c Ala Ala His Lys Glu Lys Gly Pro Thr He Gly Val Asp 

465 470 475 480 

GTC TTC GAG GGC GAG CCG GCC GAC ATG CTC GAG CGC GGC GTT 
'5 ATC GCC 1608 

Val Phe Glu Gly Glu Pro Ala Asp Met Leu Glu Arg Gly Val He Ala 
485 400 495 

20 CCG GTC AGCi G IT CCG AAG CAG GCC ATC AAG AGC GCC AGC GAG 

GCTGCC 1656 

Pro Val Arg Val Pro Lys Gin Ala He Lys Ser Ala Ser Glu Ala Ala 
500 505 510 

2S 

ATA ATG ATC CTC AGG ATC GAC GAC GTC ATC GCC GCC AGC AAG 
CTC GAG 1 704 

He Met He Leu Arg He Asp Asp Val He Ala Ala Ser Lys Leu Glu 
30 515 520 525 

AAG GAC AAG GAG GGC GGC AAG GGC GGT AGC GAG GAT TTC GGA 
AGC GAC 1752 

35 Lys Asp Lys Glu Gly Gly Lys Gly Gly Ser Glu Asp Phe Glv Ser Asp 

530 535 540 

CTT GAC TGA AGCCCTTTGA TTTCTTTTCT CTTCAAATTT 
40 GTGTTCTTA 1 800 

Leu Asp * 
545 



(2) INFORMATION FOR SCQ II) NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(XI) StQUtNCt DESCRir i ION: SHQ lU NO; 8; 

Met Ala Gin Leu Ala Gly Gin Pro Val Val lie Leu Pro Glu Gly Thr 
1 5 10 15 

Gin Arg Tyr Val Gly Arg Asp Ala Gin Arg Leu Asn He Leu Ala Ala 
20 25 30 

Arg He lie Ala Glu Thr Val Arg Thr Thr Leu Gly Pro Lys Ci\y Met 
35 40 45 

Asp Lys Met Leu Val Asp Ser Leu Gly Asp lie Val lie Thr Asn Asp 
50 55 60 

Gly Ala Thr He Leu Asp Glu Met Asp He Gin His Pro Ala Ala Lys 
65 70 75 80 

Mel Met Val Glu Val Ala Lys Thr Gin Asp Lys Glu Ala Gly Asp Gly 
85 90 95 

Tlor Thr Thr Ala Val Ala He Ala Gly Glu Leu Leu Arg Lys Ala Glu 
100 105 110 

Glu Leu Leu Asp Gin Asn lie His Pro Ser He lie He Lys Gly Tyr 
115 120 125 

Ala Leu Ala Ala Glu Lys Ala Gin Glu He Leu Asp Glu He Ala Lys 
130 135 140 

Asp Val Asp Val Glu Asp Arg Glu He Leu Lys Lys Ala Ala Val Tlir 
145 150 155 160 

Ser He Thr Gly Lys Ala Ala Glu Glu Glu Arg Glu Tyr Leu Ala Glu 
165 170 175 

He Ala Val Glu Ala Val Lys Gin Val Ala Glu Lys Val Gly Glu Tlir 
180 185 190 

Tyr Lys Val Asp Leu Asp Asn He Lys Phe Glu Lys Lys Glu Gly Gly 
195 2O0 205 

Scr Val Lys Asp Thr Gin Leu He Lys Gly Val Val He Asp Lys Glu 
210 215 220 
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Val Val His Pro Gly Met Pro Lys Arg Val Glu Gly Ala Lys He Ala 
225 230 235 240 

5 

Leu He Asn Glu Ala Leu Glu Val Lys Glu Thr Glu Thr Asp Ala Glu 
245 250 255 

He Ar^ He Thr Ser Pro Glu Gin Leu Gin Ala Phe Leu Glu Gin Glu 
260 265 270 

Cilu Lys Met Leu Arg Glu Mel Val Asp Lys He Lys Glu Val Gly Ala 
^5 275 280 285 

Asn Val Val Phe Val Gin Lys Gly He Asp Asp Leu Ala Gin His Tyr 
290 295 300 

20 

Leu Ala Lys Tyr Gly He Met Ala Val Arg Arg Val Lys Lys Ser Asp 
305 '310 315 320 

2S Met Glu Lys Leu Ala Lys Ala Thr Gly Ala Lys He Val Thr Asn Val 

325 330 335 



Arg Asp Leu I hr Pro Glu Asp Leu Gly Glu Ala Glu Leu Val Asp Gin 
340 345 350 

Arg Lys Val Ala Gly Glu Asn Met He Phe Val Glu Gly Cys Lys Asn 
355 360 365 

Pro Lys Ala Val Thr He Leu He Arg Gly Gly Thr Glu His Val Val 
370 375 380 

Asp Glu Val Glu Arg Ala Leu Glu Asp Ala Val Lys Val Val Lys Asp 
385 390 395 400 

He Val Glu Asp Gly Lys He Val Ala Ala Gly Gly Ala Pro Glu He 
405 ' ' 410 415 

Glu Leu Ala He Arg Leu Asp Glu Tyr Ala Lys Glu Val Gly Gly Lys 
420 ^ 425 430 

Glu Gin Leu Ala He Glu Ala Phe Ala Glu Ala Leu Lys Val He Pro 
435 440 445 

Arg Thr Leu Ala Glu Asn /Ma Gly ],eu Asp Pro lie Glu Thr Leu Val 
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450 455 460 

Lys Val lie Ala Ala His L\ s Glu Lys Gly Pro Thr lie Gly Val Asp 
465 470 475 480 

Val Phe Glu Gly Glu Pro Ala Asp Mel Leu Glu Arg Gly Val lie Ala 
485 ' 490 495 

Pro Val Arg Val Pro Lys Gin Ala He Lys Ser Ala Ser Glu Ala Ala 
500" 505 510 

He Mel He Leu Arg He Asp Asp Val He Ala Ala Ser Lys Leu Glu 
515 520 525 

Lys Asp Lys Glu Gly Gly Lys Gly Gly Ser Glu Asp Phe Gly Ser Asp 
530 535 540 

Leu Asp * 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARAC l ERISTlCS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: otlier nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AGGGGCCATG GCCCAGCTCG CAGGCCAGC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 



10 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AAAAGGGATC CAAGGTCATC AGTCAAGG 

75 

(2) INFORMATION FOR SEQ ID NO: 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: other nucleic acid 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 : 
TGGTACCATG GCAAAGTATT CCGAACTCGA 

35 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
( A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

45 

(ii) MOLECULE TYPE: other nucleic acid 



so 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

£5 



30 
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CGGATCCGAT ATCAGCTATG ACCTTTA 

5 (2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH; 29 base pairs 
'0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

'5 (ii) MOLECULE 1 YPE: other nucleic acid 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTGACCATGG GAAAGGCGCl" GATGG'I TCA 

25 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE; other nucleic acid 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CTAGGATCCA AQTCTCTGGA ITATGTACTG GA 



so 



ss 
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Claims 

1. An expression cassette which can express a desired protein in a cell wherein the cassette connprises a sequence 
in which a gene encoding a molecular chaperon is operably linked to a first promoter and an insertion site into 

5 which a gene encoding the desired protein can be inserted. 

2. An expression cassette as claimed in claim 1 . wherein the cell is a bacterial cell and the desired protein is expressed 
in a soluble form when, in the absence of the molecular chaperon the desired protein would be expressed as an 
insoluble protein 

10 

3. An expression cassette as claimed in any of the preceding claims, wherein the cassette has a second promoter, 
which IS upstream of the insertion site. 

4. An expression cassette as claimed in any of the preceding claims wherein the cassette has a terminator sequence 
15 downstream of the gene encoding the molecular chaperon and downstream of the insertion site. 

5. An expression cassette as claimed in any of the preceding claims wherein the gene encoding the molecular chap- 
eron is a heat shock protein gene of a hyperthermophilic archaeon KOD-1. 

20 6. An expression cassette as claimed in any of claims 1 to 4, wherein the gene encoding the molecular chaperon is 
a GroESL gene of Bacillus stearotherrnophilus SIC!. 

7. An expression cassette as claimed in any of the preceding claims wherein the first and/or the second promoter is 
a T7 promoter. 

25 

8. An expression vector comprising an expression cassette as claimed in any of claims 1 to 7, in which a gene 
encoding the desired protein is operably incorporated into the insertion site. 

9. A cell which can express a desired protein, wherein the cell is transformed with an expression cassette as claimed 
30 in any of claims 1 to 7 or an expression vector comprising the expression cassette as claimed in claim 8. 

10. A cell which can express a desired protein, wherein the host cell is co-transformed with a vector which can express 
a gene encoding a molecular chaperon and a vector which can express a sequence encoding the desired protein. 

35 11. The cell of claim 10, wherein the cell is a bacterial cell and the gene encoding the molecular chaperon is a heat 
shock protein gene of a hyperthermophilic archaeon KOD-1 . 

12. The cell of claim 10, wherein the cell is a bacterial cell and the gene encoding the molecular chaperon is a GroESL 
gene of Bacillus stearotherrnophilus SlCl, 

40 

1 3. A method of expressing a desired protein, the method comprising a step of culturing a cell which can co-express 
a gene encoding a molecular chaperon and a gene encoding the desired protein. 

14. A method as claimed in claim 1 3, wherein the celt ts transformed with a vector having a gene encoding a molecular 
chaperon which is operably linked to a first promoter and a gene encoding the desired protein which is operably 
linked to a second promoter 

15. The method as claimed in claim 13. wherein the cell is co-transformed with a vector which can express a gene 
encoding a molecular chaperon and a vector which can express a sequence encoding the desired protein. 

50 

16. A method as claimed in claim 13, wherein the cell is a bacterial cell and the desired protein is expressed in a 
soluble form when, in the absence of the molecular chaperon the desired protein would be expressed as an insol- 
uble protein 

55 17. A method as claimed in claim 16, wherein the gone encoding the molecular chaperon is a heat shock protein gene 
of a hyperthermophilic archaeon KOD-1 

18. A method as claimed in claim 16, wherein the gene encoding the molecular chaperon is a GroESL gene of Bacillus 
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stearolhermophilus SiCI, 

19. A method of expressing a desired protein connprising: 

5 cultunng a cell comprising an expression cassette as claimed in any of claims 1 to 7 or an expression vector 

as claimed m claim 8; co-expressinq the molecular chaperon and the desired protein; 
heating the cell culture or a fraction containing the desired protein; 
separating an insoluble fraction; and 
recovering the desired protein. 

10 

20. A method as claimed in claim 19, wherein the cell is transformed with a vector in which the gene encoding the 
molecular chaperon is operably linked to the first promoter and the gene encoding the desired protein is operably 
linked to the second promoter. 

75 21. A method of expressing a desired protein comprising; 

cultunng a cell which has been co-transformed with a vector which can express a gene encoding a molecular 
chaperon and a vector which can express a sequence encoding a desired protein, co-expressing the molecular 
chaperon and the desired protein; heating the cell culture or a fraction containing the desired protein; separating 
an insoluble fraction and recovering the desired protein. 

20 

22. A method of changing a heat labile protein to a heat stable protein comprising mixing the heat labile protein with 
a heat stable molecular chaperon. 

23. A method of purifying a heat labile protein comprising: 

25 

mixing the heat labile protein with a heat stable molecular chaperon; and 
heating the mixture. 

24. A KOD-1 heat shock protein compnsing an amino acid sequence of SEQ ID NO 7 

30 

25. A gene encoding a KOD-1 heat shock protein compnsing an amino acid sequence of SEQ ID NO: 7. 



35 



40 



45 



50 



55 
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